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Abstract 

In this paper, we analyze the fundamental tradeoff of diversity and multiplexing in multi-input 
multi-output (MIMO) channels with imperfect channel state information at the transmitter (CSIT). We 
show that with imperfect CSIT, a higher diversity gain as well as a more efficient diversity-multiplexing 
tradeoff (DMT) can be achieved. In the case of multi-input single-output (MISO)/single-input multi- 
output (SIMO) channels with K transmit/receive antennas, one can achieve a diversity gain of d{r) = 
K{\ — r + Ka) at spatial multiplexing gain r, where a is the CSIT quality defined in this paper For 
general MIMO channels with M (M > 1) transmit and N (N > 1) receive antennas, we show that 
depending on the value of a, different DMT can be derived and the value of a has a great impact on 
the achievable diversity, especially at high multiplexing gains. Specifically, when a is above a certain 
threshold, one can achieve a diversity gain of d{r) = AIN{1 + MNa) — (Af + N — l)r; otherwise, 
the achievable DMT is much lower and can be described as a collection of discontinuous line segments 
depending on M, N, r and a. Our analysis reveals that imperfect CSIT significantly improves the 
achievable diversity gain while enjoying high spatial multiplexing gains. 
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I. Introduction 

The performance of wireless communications is severely degraded by channel fading caused 
by multipath propagation and interference from other users. Fortunately, multiple antennas can 
be used to increase diversity to combat channel fading. Antenna diversity where sufficiently 
separated or different polarized multiple antennas are put at either the receiver, the transmitter, 
or both, has been widely considered On the other hand, multi-antenna channel fading 

can be beneficial since it can increase the degrees of freedom of the channel and thus can provide 
spatial multiplexing gain. It is shown in |l3l that the spatial multiplexing gain in a multi-input and 
multi-output (MIMO) Rayleigh fading channel with M transmit and N receive antennas increases 
linearly with min(M, A^) if the channel knowledge is known at the receiver. As MIMO channels 
are able to provide much higher spectral efficiency and diversity gain than conventional single- 
antenna channels, many MIMO schemes have been proposed, which can be classified into two 
major categories: spatial multiplexing oriented (e.g.. Layered space-time architecture JH), and 
diversity oriented (e.g., space-time trellis coding Q, |l6l, and space-time block coding |f8l). 

For a MIMO scheme realized by a family of codes {C(p)} with signal-to-noise ratio (SNR) p, 
rate R{p) (bits per channel use), and maximum-likelihood (ML) error probability Ve{p), Zheng 
and Tse defined in [19|| the spatial multiplexing gain r as r = limp^oo £^ and the diversity gain 
(i as (i = — limp^oo Under the assumption of independent and identically distributed (i.i.d.) 
quasi-static flat Rayleigh fading channels where the channel state information (CSI) is known 
at the receiver but not at the transmitter, for any integer r < min(M, A^), the optimal diversity 
gain d*{r) (the supremum of the diversity gain over all coding schemes) is given by [HI 

d*{r) = {M -r){N ~r) (1) 

provided that the code length L > M + N — 1. The diversity-multiplexing tradeoff (DMT) in ([T]) 
provides a theoretical framework to analyze many existing diversity-oriented and multiplexing- 
oriented MIMO schemes. It indicates that the diversity gain cannot be increased without pe- 
nalizing the spatial multiplexing gain and vice versa. This pioneering work has generated a 
lot of research activities in finding DMT for other important channel models [fT0|-[IT3| and 
designing space-time codes that achieve the desired tradeoff of diversity and multiplexing gain 
[fT4l - [fT6l . The DMT analysis was extended to multiple-access channels in [lOJ. The automatic 
retransmission request (ARQ) scheme is shown to be able to significantly increase the diversity 
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gain by allowing retransmissions with the aid of decision feedback and power control in block- 
fading channels [11]. The work in [12] investigated the diversity performance of rate-adaptive 
MIMO channels at finite SNRs and showed that the achievable diversity gains at realistic SNRs 
are significantly lower than those at asymptotically high SNRs. The impact of spatial correlation 
on the DMT at finite SNRs was further studied in [I13II . 

It is natural to expect that the DMT can be further enhanced through power and/or rate 
adaptation if the transmitter has channel knowledge. If the CSI at the transmitter (CSIT) is 
perfectly known, there will be no outage even in slow fading channels since it is always able 
to adjust its power or rate adaptively according to the instantaneous channel conditions. For 
example, it can transmit with a higher power or lower rate when the channel is poor and a lower 
power or higher rate when the channel is good. However, in practice the CSIT is almost always 
imperfect due to imperfect CSI feedback from the receiver or imperfect channel estimation at 
the transmitter through pilots. The work in [[TtII showed that the transmitter training through 
pilots significantly increases the achievable diversity gain in a single-input multi-output (SIMO) 
link. In [18], the authors quantified the CSIT quality as a = — log (Xg / log p, where is the 
variance of the CSIT error, and showed that using rate adaptation, one can achieve an average 
diversity gain of d{a,f) = (1 + a — f)K in SIMO/MISO links, where K = ma.x{M,N) and 
f is the average multiplexing gain. Note that setting a = 1 and ignoring the multiplexing gain 
loss due to training symbols directly yields the result in ifTTl . For general MIMO channels, the 
achievable DMT with partial CSIT is characterized in |[T9l , where the partial CSIT is obtained 
using quantized channel feedback. 

In this paper, we analyze the fundamental DMT in MIMO channels and show that with power 
adaptation, imperfect CSIT provides significant additional diversity gain over ([T]). The imperfect 
CSIT considered in this paper is due to channel estimation error at the transmitter side. In the 
case of MISO/SIMO channels, we show that with power adaptation (under an average sum 
power constraint), one can achieve a higher diversity gain than that with rate adaptation in [18], 
where the authors assumed peak power transmission and thus no temporal power adaptation is 
considered therein. Specifically, we prove that with a CSIT quality a, the achievable diversity 
gain is d{r) = K (1 — r + Ka). It has been shown in our earlier work [20] that this is actually 
the optimal DMT in SIMO/MISO channels with CSIT quality a. For general MIMO channels 
(M > 1, > 1), we show in this paper that depending on the value of a, different DMT 
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can be derived and the value of a has a great impact on the achievable diversity, especially at 
high multiplexing gains. Specifically, when a is above a certain threshold, one can achieve a 
diversity gain of d{r) = MN{1 + MNa) ~ {M + N - l)r; otherwise, the achievable DMT 
is much lower and can be described as a collection of discontinuous line segments depending 
on M, A^, r and a. It is noted that an independent and concurrent work recently reported in 
[|2T| shares some similar results. However, we wish to emphasize that our CSIT model and the 
involved analysis towards the achievable DMT are different from those in [|2T|. The noisy CSIT 
therein is based on the channel mean feedback model in [l22| and an example of obtaining CSIT 
through delayed feedback is provided, whereas the CSIT in our work is estimated from reverse 
channel pilots using ML estimation at the transmitter. As the variance of the channel estimation 
error is inversely proportional to the pilots' SNR ll23l . the CSIT quality a is naturally connected 
to the reverse channel power consumption and any value of a can be achieved by scaling the 
reverse channel transmit power. In addition, our paper provides detailed closed-form solutions to 
the achievable DMT, which offers great insight and depicts directly what the DMT curve with 
imperfect CSIT looks like. 

Notations: IZ^ denotes the the set of real A^-tuples, and IZ^^ denotes the set of non-negative 
A^-tuples. Likewise, C^^^^ denotes the set of complex N x M matrices. For a real number 
X, denotes max(x,0), while for a set C C TZ^, C+ denotes O fl TZ^^. denotes the 
complementary set of O and denotes the empty set. \0\ denotes the cardinality of set O. 
X G (a, b] denotes that the scalar x belongs to the interval a < x < b. Likewise, x E [a, b] is 
similarly defined. CJ\f{0, a"^) denotes the complex Gaussian distribution with mean and variance 
cr^. The superscripts * and 1^ denote the complex conjugate and conjugate transpose, respectively. 
II ■ \\p denotes the matrix Frobenius norm and 1^ denotes the N x N identity matrix. E{-} 
denotes the expectation operator and log(-) denotes the base-2 logarithm, /(p) = p'' denotes that 
b is the exponential order of /(p), i.e., limp_>oo log(/(p))/log(p) = b. Likewise, < is similarly 
defined. Finally, for matrix A, A ^ denotes that A is positive semidefinite; if y is used with 
a vector, it denotes the componentwise inequality. 

The rest of this paper is organized as follows. In section II, we describe the channel model. 
In section III, we propose a power adaptation scheme based on imperfect CSIT and present the 
main result on the achievable DMT. The achievability proof of the presented DMT is given in 
Section IV. Section V provides some discussions. Finally, Section VI concludes this paper. 
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II. Channel Model 

We consider a point-to-point TDD wireless link with M transmit and receive antennas, 
where the downlink and uplink channels are reciprocal. Without loss of generality, we assume 
M > in this paper. As shown in f9|, this assumption does not affect the DMT result. We 
also consider quasi-static Rayleigh fading channels, where the channel gains are constant within 
one transmission block of L symbols, but change independently from one block to another. We 
assume that the channel gains are independently complex circular symmetric Gaussian with zero 
mean and unit variance. The channel model, within one block, can be written as 

Y = ^P/MUX + W (2) 

where H = {hn-m} G C^^*^ with hn,m, rn = 1, 2, . . . , M, n = 1, 2, . . . , A^, being the channel 
gain from the m-th transmit antenna to the n-th receive antenna; X = {X^ i} E C*^^^ with 
Xm,i, m = 1,2,..., M, / = 1, 2, . . . , L, being the symbol transmitted from antenna m at time 
/; Y = {Yn,i} G C^""^ with Yn,i, n = 1, 2, . . . , iV, / = 1, 2, . . . , L, being the received signal at 
antenna n and time /; the additive noise W G C^^^ has i.i.d. entries Wn,i ~ CJ\f{0, cr^); P is the 
instantaneous transmit power while the average energy of Xm,i is normalized to be 1. Letting 
P denote the average sum power constraint, we have E{P} = P. So, the average SNR at the 
receive antenna is given by p = P/cr^. 

We assume that the receiver has perfect CSI H G C^^^, but the transmitter has imperfect 
CSIT H G C^^*^, which is estimated from reverse channel pilots using ML estimation. Thus, 
H can be modeled as ll23]l-[|25l 

H = H + E (3) 

where the channel estimation error E G C^^^ has i.i.d. entries En^m ~ CA/'(0,crg), n = 
1,2, . . . , N, m = 1,2, . . . , M, and is independent of H. The quality of H is thus characterized by 
cTg. If cTg = 0, the transmitter has perfect channel knowledge; if a"^ increases, the transmitter has 
less reliable channel knowledge. We follow ifTSl to quantify the channel quality at the transmitter. 
The transmitter is said to have a CSIT quality a, if = p^". The definition of a builds up 
a connection between the imperfect channel knowledge at transmitters and the forward channel 
SNR, p. Since the variance of the channel estimation error is inversely proportional to the pilots' 
SNR, i.e., cTg oc {SNRpUot)~^ ll23l . any value of a can be achieved by scaling the reverse channel 
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power such that SNRpHot = p°. One can see that the selection of a value actually determines 
the cost of obtaining CSIT in terms of the reverse channel power consumption. When o; = 0, 
the reverse channel SNR does not scale with p, which means that the pilot power is fixed or 
limited; when < « < 1, the reverse channel SNR relative to p is asymptotically zero; when 
a = 1, the reverse channel SNR scales with p at the same rate; when a > 1, the reverse 
channel SNR as compared to the forward channel SNR, p, is asymptotically unbounded [[T8l . In 
the sequel, we will study how the pilot power, or equivalently the CSIT quality a, affects the 
fundamental tradeoff of diversity and multiplexing in the considered channel. Before presenting 
our main results, we give the following probability density function (pdf) expressions and some 
preliminary results that will be used later. 

For m N X M (N < M) random matrix A with i.i.d. entries ~ CAf{0, 1), let < Ai < A2 < 
... < A AT denote the ordered nonzero eigenvalues of AA^ Letting t>„ denote the exponential order 
of 1/Xn for all n, the pdf of the random vector v = [vi, ...,vn] is given by [|26l 



N N / ^ \ 

p(v) = jiim r'llogp)"^ n p-^''~^-^^>- nip""" - p-'O'exp - p-'- 

n=l j>'n. \ n=l / 

(4) 

0, for any u„ < 

niLiP"^'""'^^^"^^"", for all Vn>Q 
where ^ is a normalizing constant. Hence, the probability Vo that (vi, ...,vn) belongs to set O 
can be characterized by 

N 

Vo = p~'^° , for do= inf V (2n - 1 + M - N)vn (5) 

ivi,...,VN)eo+ 

provided that C+ is not empty. 

Letting a = [oi, 02..., a^], < ai < a2 < ■■■ < aj^, b = [61, b2..., bj\f], < bi < b2 < ■■■ < b^, 
and c = [ci, C2..., Cat], < ci < C2 < ... < cat, denote the eigenvalue vectors of HH^ HH^ and 
EE^ respectively, the pdfs of a, b, and c can be shown to be 

N \ 

~ (6) 



N N / ^ \ 

p(a) = n n («n - exp - ^ aj 

n=l n<j \ n=l / 

p(b) = n c-^ n - b,r exp --^ 6„ (?) 

n=l n<j \ ^ ' n=l / 

P(C) = n Cn'-^ n - exp -- ^ C J (8) 

n=l n<j \ e n=l / 
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where = r'(l + a^)-*^^ and = r^^-'^-*'^ 



III. Main Result on DMT 

The ML error probability Ve{p) of the channel described in Q is closely related to the asso- 
ciated packet outage probability Tout, which is defined as the probability that the instantaneous 
channel capacity falls below the target data rate R{p). In fact, the error probability of an ML 
decoder which utilizes a fraction of the codeword such that the mutual information between 
the received and transmitted signals exceeds LR{p) (no outage), averaged over the ensemble of 
random Gaussian codes, can be made arbitrarily small provided that the codeword length L is 
sufficiently large ETl . We will thus leverage on the outage probability to examine the achievable 
diversity gain. If the transmitter has perfect CSIT, it may adopt the optimal power adaptation 
according to the actual instantaneous channel gain such that no outage will occur. With only the 
imperfect CSIT, in order to mitigate the channel uncertainty, we propose the following power 
adaptation scheme. 

Proposition 1: Given H, the transmitter transmits with power 

K P 

^(H) = , ^ u (9) 

where k = illn=i K'^^ - 1 + M - A^)(l - t)] and t (0 < t < 1) can be chosen arbitrarily close 
to 1. 

It is shown in Appendix A that the above power adaptation scheme satisfies the sum power 
constraint £'{P(H)} = P. We believe that given the CSIT quality of a, this power adaptation 
scheme is the optimal power adaptation scheme that maximizes the achievable diversity gain of 
a MIMO fading channel. 

Theorem 1: Consider a MIMO channel with M transmit and receive antennas (M > A^) 
and CSIT quality of a. If the block length L > M + A^ - 1, the achievable DMT using the 
power adaptation scheme in Proposition 1 is characterized by 

Case 1: If A^ = 1 or a > then 

d{r) = MN{1 + MNa) - {M + N -l)r. (10) 
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Case 2: Otherwise, the achievable DMT is a collection of discontinuous line segments, with 
the two end points of line segment dk{r) (k E B) given by 

Left end: 4(r) = k{M -N + k)T{k), for r = {N - k)T{k) 

Right end: 4(r) = {{N - k){k - N - I) + MN)T{k) - {2k - I + M - N){N - I{k))T{I{k)), 

for r = (A^- J(A;))r(J(A;)) 

(11) 

where 

B = [k\{M - N + k){N - k) < l/a,{N - k)T{k) < {N~k)T{k),yk <k,k = 1,...,n], 

T{k) = 1 + ka{M — N + k) and I{k) = max^gg_^<;. k. 

For example, when M = N = 2 and a < 1, the DMT curve consists of two discontinuous line 
segments which are (0, 16a + 4) — (1 + a, 13a + 1) and (1 + a, 1 + a) — (2, 2a). When r = 1 + a, 
the achievable diversity gain is d{r) = 1 + a instead of 13a + 1. From Theorem 1, we can get 

d{0) = MN{1 + MNa) and d{N) = pa{M - N + p){MN + {p - N){N - p + 1))) - + p 
where p = min^g/? k. If a < (jv-i)(M-Ar+i) ' which indicates 1 G i3, we will have d{N) = 
aN{M - N + 1)^. 

IV. Proof of Theorem 1 

The proof involves the computation of the asymptotic ML error probability at high SNRs. We 
will first derive a lower bound of the SNR exponent of the outage probability, denoted as do{r), 
and then show that using a random coding argument the SNR exponent of the error probability 
is no less than do{r) if L > M + N - 1. 

A. Derivation of do{r) 

Optimizing over all input distributions, which can be assumed to be Gaussian with a covariance 
matrix Q without loss of optimality, the outage probability of a MIMO channel with transmit 
power P(H) is given by 

Tout = inf , V flogdet (l^ + ^HQHt) < R{p)\ (12) 

where V{-) denotes the probability of an event. It is shown in [9] that one can get an upper bound 
and a lower bound on the outage probability by picking Q = Ia/ and Q = MIm, respectively. 
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and the two bounds converge in the high SNR regime. Therefore, without loss of generality, we 
consider Q = Im- Substituting ^ in (fT2l) . we have 



V, 



out 



V logdet + 



r (log n f 1 + — 



pKttr, 



HHt^ < R{p)^ 

-) < Rip)) ■ 



(13) 



tN i(2n-l+M-N)t 
ln=l 

Lemma 1: The eigenvalues of Hh\ HH^ and EE^ have the following relationship 

fen <2(a„ + c^), n = l,2,...,Ar. (14) 

Proof: We obviously have the following equality 

(H + E)(H + E)'^ + (H - E)(H - E)^ = 2(HH1' + EE^) (15) 

where both (H + E)(H + E)t and (H — E)(H — E)''" are positive semidefinite matrices. We denote 
the vector of eigenvalues of (HH^ + EE^) as d = [di, cLn] with di < d2 < ... < d^- Since the 
eigenvalues of the sum of two positive semidefinite matrices are at least as large as the eigenvalues 
of any one of the positive semidefinite matrices [[281, we have fe„ < 2dn, n = 1,2,...,N. 
Further, using the relationship of the eigenvalues of the sum of Hermitian matrices, we get 
an + ci < dn < an + cn, n = 1, 2, A^. It thus directly leads to (fT4|) . ■ 
With Lemma 1, the outage probability is upper bounded by 



N 



log n 1 



n=i V Mnti(2a„ + 2c^)(2-i+M-iv) 
Let Vn and u„ denote the exponential orders of l/a„ and l/c^, respectively, i.e., f„ 



(16) 



limp^oo 1^^- Using ©, © and the pdfs of the random vector 



V = [f 1, Vn] and u = [ui, un] can be shown to be 

0, for any f „ < 

nn=iP"('""'+*'^'^^"", for all Vn>0 



p{\) 



p(u) 



0, for any m„ < a 

nil p-(2«-i+M-^){«.-"), for all Un > a. 
At high SNRs, with ([IT]) and ([T8]), becomes 



(17) 



(18) 



Vout < V 



N 



El 



n=l 



N \ 

t{2n -1 + M - N) min(w„, un) 

n=l J 



< r 



(19) 
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So, the outage event O in (fT9l) is the set of {f i, . . . , f at, Mi, . . . , mat} that satisfies 

N / N 



N / N \ 

^ 1 - + 5^ t(2n -1 + M-N) minK,^^) 

n=l \ n=l / 



< r (20) 



where fn > 0, m„ > a > 0, n = 1, 2, A^. 
According to (|5]), we have 

Vout <Vo = p-"^^'') (21) 
where do^r) serves as a lower bound of the SNR exponent of Vout and is given by 

N 

do{r)= inf V(2n - 1 + M - A^) K + u„ - a) . (22) 

{v-i,...,vm,ui,...,um)€0 

Next, we work on the explicit expression of do{r). Since the left hand side (LHS) of (|20l ) 
is a non-decreasing function of ua^, decreasing un will not violate the outage condition in (|20|) 
while enjoying a reduced SNR exponent X]^=i (2?^^ — 1 + M — A^) (f„ + u„ — a). Combining with 
the fact M„ > a, n = 1, 2, iV, the solution of u is found to be = ... = = a. Therefore, 
(|20|) can be rewritten as 







/ AT 




E 


l-Vn + Yl ^(2^ - 1 + M 




n=l 


\ n=l 



To solve the optimization problem of (|22l) . we need to solve the subproblems 



A 



4(r) = inf V(2n-1 + M-Ar)t;„, A; = 0,l,...,Ar (24) 

(''iv,fjv)eCfc , 



^ n=l 

where subset Ok (0 < k < N) is defined as 





N 




N 


Ok = l^Vn 






h M - N)a + ^(2^ - 1 + M 




n=l 


\ n=l 


n=fc+l 



^^1 > ••• > ^^fe > a > ffe+i > ••• > vn} ■ 
So, do{r) is given by 

(ic)(r) = min {do{r), di{r),..., dN^r)) . (25) 

In other words, among all the DMT curves dQ(r),...,dNir), corresponding to the outage subsets 
Oi,...,On, the lowest one will be the DMT curve for the entire outage event. Since t can be 
made arbitrarily close to 1, it is without loss of accuracy to set t = 1 in the rest of this paper. 
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Firstly, we derive do{r). It is easy to show 

N / N \+ N N 

J2 l-t;„ + ^(2n-l + M-Ar)t;„ > A^-^ y„+A^ ^(2n-l + M-Ar)t;„ >N (26) 

n=l \ n=l I n=l n=l 

which suggests that it is possible to operate at spatial multiplexing gain r G [0, A^] reliably 
without any outage, i.e., do(r) = oo. So we can exclude do(r) from the optimization problem 
in (125]). 

Secondly, we derive 4(r) (1 < A; < A^). Note that the function XI^Li (l " + ELi(2^ - 1 
+M — N)a + Z]^=fc+i(2^ — 1 + M — N)vn^^ is an increasing function of Vk+i, ffc+2, v^. 
That is, decreasing Vk+i, Vk+2---,VN does not violate the outage condition for Ok, while reducing 
the SNR exponent — 1+^— ^)^^n- Therefore, the optimal solutions of ffc+i, ■■■,vn 

are vl^^ = vl^2 = ■■■ = v^^ = 0. Consequently, the optimization problem in (|24|) can be 
reformulated as 

k 

dk{r)= inf _ ^(2n - 1 + M - iV)?;„, A; = 1, iV. (27) 

(i>i,...,i)fe)edfc 

Here the modified outage subset Ok is defined as 



Ok = l^^i, Vk NT{k) -J2^Vn<r,a<Vk<...<Vi< r(fc)| . (28) 
where r(A;) = 1 + ka{M — N + k). Careful observation of (|28] ) reveals that 

k 

NT{k) -J2vn> (N - k)T{k). (29) 

n=l 

It implies that there will be no outage (dk{r) = oo), if r < (iV — k)T(k) or (A^ — k)T{k) > N. 
Note that {N-k)T{k) >N^ {M -N + k){N -k) > 1/a. So, if {M-N + k){N-k) < 1/a, 
there will be nonzero outage {dk{r) < oo), for r G where f2/c is defined as 

ilk = i{N-k)T{k), N]. (30) 

For any r G ^7^^ we are able to explicitly calculate the optimal solutions of Vi,...,Vk that 
minimize the SNR exponent I]n=i(2'^ — 1 + M — N)vn- The results are summarized in the 
following. 

1) If r = (A^ — k')T{k) — {k — k')a, k' = 1, 2, k, then the achievable diversity for outage 
event Ok is 

4(r) = k'{M -N + k')T{k) + {k - k'){k + k' + M - N)a. (31) 
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The corresponding optimal solutions of vi, ...v^ are vl = ... = v^, = rik), vl,j^^ = ... = 
vl = a. Specifically, 4(r) = k{M -N + k)T{k) for r = {N - k)T{k). 
2) If {N - k')T{k) -{k- k')a < r < {N - k' + l)r(A;) - {k - k' + l)a, k' = 1, 2, k, the 
achievable diversity for outage event Ok is 

4(r) = ((N-k')ik'-N-l)+MN)T{k) + {k-k' + l){k-k')a-{2k'-l+M-N)r. (32) 

The corresponding optimal solutions of vi,...,Vk are = ... = Vk'-i = T{k), v^i = 
{N -k' + l)T{k) - (/c - k')a - r, v*^,^-^ = ... = vl = a. 

For a particular k' , when spatial multiplexing gain r is between (A^ — k')T(k) — (k — k')a and 
(N — k' + 1)t(A;) — {k — k' + l)a, only one singular value of H, corresponding to the typical 
outage event, needs to be adjusted to be barely large enough to support the data rate. (|32|) further 
shows that dk{r) is a continuous line segment between these two points. It is thus obvious that 
curve dk{r) is piecewise-linear with (r,dk{r)) specified in (|3T1) being its corner points. 

After calculating di{r), dN{r), we remain to solve do{r) = min^ dk{r), k = 1, N. Since 
dk{r) = oo if (M — N + k){N — k) > 1/a, we only need to consider k E A, where set A is 
defined as 

^= {A;|(M-iV + A;)(A^- A;) < l/a,k= 1,...,A^}. (33) 

Note that we always have k = N G A. We consider the following two cases. 
Case 1: A has only one element, i.e., A = {k = N}. In this case, we have (io(r) = (ijv(r). If 
= 1, this condition is naturally satisfied, since there is only one element in A that is k = 1. 
If > 1, we must require (M - + /c)(A^ - A;) > 1/a for /c = 1, A^ - 1, which leads to 

a > TT^, N>1. (34) 
- M- 1 

We now examine the comer points of d^^r). From (|3T1) . we have r = (N — k'){l + aMN — 
a) > 1 + MNa - a for corner point k' (k' = 1,2, A^ - 1). Since 1 + MNa - a is a 
non-decreasing function of a, we easily get r > 1 + — -jj^ > N. Thus we conclude 
that there is only one corner point (0,(iiv(0)) on curve dN{r) over region r G ^In- Therefore, 
doi^) = dN{r) is a straight line between comer points (0,(ijv(0)) and (N,dN{N)). From (|32l) , 
we have dN{N) = MN{1 + MNa) - {M + N - 1)N, so do{r) can be described as 

do{r) = MN{1 + MNa) - {M + N - l)r for < r < A^. (35) 
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Case 2: A has more than one element. Since N e A and — [0, A'^], Qfc (k N,k e A) 
overlaps with il^- That is, there are some regions of spatial multiplexing gain r, leading to finite 
diversity gains on different DMT curves. A straightforward method to find doir) is to numerically 
calculate dk{r) for all k E A, and choose the minimum value among them. However, this makes 
do{r) implicit and hardly insightful. To find the closed-form solution of do{r), we wish to find 
out if there is any relationship among di(r), dN{r). This motivates the birth of the following 
Lemma, the proof of which is given in Appendix B. 

Lemma 2: For any spatial multiplexing gain r e n^fe2 (1 < ^i, ^2 < N), if ki < k2, we 
have 4^(r) < dt^ir). 

This Lemma tells us if a spatial multiplexing gain r leads to finite diversity gains on two 
DMT curves, we only need to select the curve with lower diversity gain. For example, if r e 
^1 n^2 n ■■■ n^AT, then do{r) — o?i(r) since di{r) < d2{r) < ... < dN{r) < 00. Therefore, we 
can further expurgate bad k (s.t. flk Q ^k, fork<kEA) from A and only take into account 
A; e B for the optimization problem, where 

B = [k\{N - k)r{k) <{N- k)r{k), Vk < k,k,k e A} . (36) 

Letting \B\ denote the cardinality of B, we further divide r G [0, iV] into \B\ non-overlapping 
regions with region ^1^ (k e B) defined as 

Qk = ^kO^l, yk KkkkeB 

(37) 

= [{N -k)T{k),{N -X{k))T{X{k))) 

where T{k) indicates the immediately preceding element of k in B, i.e., T{k) — max^^j. j^gg k. 
From Fig. 1, which illustrates the relationship between and Q^, we get do{r) — dk{r) for 
any r e Ctk. 

Next we examine the comer points on curve dk{r) over r e Qfe and give the following Lemma, 
the proof of which is given in Appendix C. 

Lemma 3: For k E B, there is only one corner point, {{N — k)T{k), k{M — N + k)T{k)), 
making r e VL^. 
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As a result, dk{r) over r G is just a single line segment connecting the following two end 
points 

Left end: 4(r) = k{M -N + k)T{k), for r = {N - k)T{k) 
Right end: 4(r) = {{N - k){k - N - 1) + MN)T{k) - {2k - 1 + M ~ N){N - I{k))T{I{k)), 
for r = (A^- J(A;))r(J(A;)). 

(38) 

Finally, since doir) is the union of dk{r) over r G for all k E B, the DMT curve over the 
entire outage event is the collection of all the involved line segments and the two end points of 
line segment dk{r) (k G B) are described in (l38l) . It should be noted that these line segments 
are discontinuous though r is continuous between and N. Combining the above Cases 1 and 
2 directly leads to (flOl) and (fTTj) in Theorem 1. 



B. Achievability Proof 

To complete the proof of the Theorem 1, we need to show that Ve{p) < p^'^oir) if > 
M + N — 1. With the ensemble of i.i.d. complex Gaussian random codes at the input, the ML 
error probability is given by [|9|| 

r^{p) = VoV{enor\0) + P(error, O") <Vo + P(error, O") (39) 

where O and Vq are given by (l20l) and (l22l) . respectively. 

^(error, C^) can be upper-bounded by a union bound. Assume that X(0), X(l) are two possible 
transmitted codewords, and that AX = X(l) — X(0). Suppose X(0) is transmitted, the probability 
that an ML receiver will make a detection error in favor of X(l), conditioned on a certain 
realization of the channel, is 



V (X(0) ^ x(r 



H,H 



V 



'P(H) 



^H(AX) 



< llwl 



(40) 



where w is the additive noise on the direction of H(AX), with variance 1/2. With the standard 
approximation of the Gaussian tail function, Q{x) < 1/2 exp(— x^/2), we have 

P(H) 



P X(0) ^X(l) H,H < exp 



|H(AX)||^ 



(41) 
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Averaging over the ensemble of random codes, we have the average pairwise error probabiUty 
conditioned on the channel realization 



L 



V (X(0) ^ X(1)|h, h) < det {\^ + ^^HHt j . (42) 

With a data rate R = r log(p) (bits per channel use), we have in total p^** codewords. Applying 
the union bound, we have 

P(error|H, H) < p^'' det f I^v + ^^HrA 



2Ma^ 

N / \ -L 

1+M-N 



(43) 



Mnti(2a„ + 2c^)2n-i+M-iv^ 

^(Er=i(l-''"+ELi(2"-l+J^-^)mi>iK.«iv))^- 



n=l 

^iV 



= P 

Averaging over the distributions of H and H, or equivalently v and u, we have 
P(error, C'^) = / p(u)p(v)7^(error|H, H)(iu(iv 

-L. p-dair) 

where 

N 

dair)^ inf y"(2n - 1 + M - N)(vn + - a) 

n=l 

/ N / N V \ 

+ L \ - t^n + X](2n -1 + M-N) min(^;„,'U;v)j -rj . 

When L > M+N-1, dair) has the same monotonicity as En=i (l - + En=i(2ri - 1 + M 
—N)a + E^=fe+i(27T' — 1 + M — N)vn^'^ with respect to Vn or n — 1, N. Therefore, the 
minimum always occurs when E^=i (l — I'n + E^=i(2n — 1 + M — N) min(i;„, un)J~^ — r. 
Hence 

AT 

dG{r)= inf ^ Y.{2n - 1 + M - N){vn + Un - a) 

Er=i(l-''"+Er=i(2'»-l+^-^)mmK,nw))'^=rn=l (46) 

= rfo(r). 
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Therefore, the overall error probability can be written as 

= p-'^o^") +P(error,0'=) (47) 

Since the MIMO channel with the proposed power adaptation scheme leads to an error 
probability lower than or equal to p~"'o('), we can say that the MIMO channel is able to achieve 
the diversity gain of do{r). Theorem 1 is thus obtained. 

V. Discussions 

In this section, we discuss the additional diversity gain Arf(r) brought by the imperfect CSIT 
through power adaptation. 

Case A) iV = 1 (MISO/SIMO): According to GO]), the imperfect CSIT provides an additional 
diversity gain of Arf(r) = M^a at any spatial multiplexing gain in the considered MISO/SIMO 
channel. Most remarkably, when a = 1/M, one can achieve both full diversity gain (i.e., M) and 
full spatial multiplexing gain (i.e., 1) at the same time, while a has to be equal to or greater than 
1 to achieve the same performance in ifTSl . Note that the case of « < 1 is much more practical 
than the case of a > 1 as one usually allocates much less power to the reverse (feedback pilot) 
channel than the forward transmission channel. 

Case B) a > -jjzi, N > 1: For such MIMO channel, according to (flOl) . the additional 
diversity gain is Ad{r) = {W^N^a + r - r^), for r = 0, 1, A^. Specifically, Ad{0) = M'^N'^a 
and Ad{N) = aM'^N'^ + N - > MN'^ + N. If < r < N, the additional diversity gain is 
between the two extreme values Arf(O) and Ad{N). 

Case C) a < A^ > 1: When r = N, AdiN) = d{N) > di{N) = aN{M-N+iy. When 

r < N, for the convenience of comparison with dH, we consider integer spatial multiplexing 
gains, i.e., r = N — k, k = 1,2, N. Since r = N — k<{N — k)T(k), from Theorem 1, the 
achievable diversity gain is d{r) > k{M-N+k)T{k) = {M-r){N-r) (1 + a{M - r){N - r)). 
Recall that the optimal diversity gain without CSIT is d*{r) = (M — r)(N — r). Therefore, the 
additional achievable diversity gain with our scheme is Arf(r) > a{M —r^ {N —rY = a ((i*(r))^. 
It indicates that even a very small a leads to a significant diversity gain improvement. 

We use numerical results to show the additional diversity gain achieved with imperfect CSIT. 
We compare the following two scenarios: 1) No CSIT [9]; and 2) Imperfect CSIT with power 
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adaptation. Figs. 2 and 3 plot the DMT curves for 3x3 and 4x2 MIMO fading channels, 
respectively. It is obvious that imperfect CSIT provides significant additional diversity gain 
improvement and offers non-zero diversity gain at any possible spatial multiplexing gain. Fig. 2 
also shows the impact of a value. When a > = i, we only have d^ir) < oo and thus 
B = {3}. Therefore, the DMT curve is a single line segment. However, when a = | < jj^, 
B = {1, 2, 3}. Therefore, the DMT curve is made up of three discontinuous line segments. Fig. 3 
shows how do{r) depends on di{r) and d2{r) in a 4 x 2 MIMO channel with a = 0.1. We 
observe that d2{r) > di(r) and there is only one comer point on di{r) (or d2{r)) over spatial 
multiplexing gain region r E Cli (or r E (I2). 

Next we illustrate the impact of a on DMT. Fig. 4 plots the relationship between the achievable 
diversity gain and the channel quality a in a MISO/SIMO channel. It clearly shows that power 
adaptation makes better use of the imperfect CSIT than rate adaptation. In other words, to 
achieve the same performance our scheme saves a great amount of pilot power and thus is 
more applicable. Specifically, the diversity gain improvements over [|9l and [fTSll are M^a and 
(M — l)Ma, respectively, at any spatial multiplexing gain. It is no doubt that the achievable 
DMT increases with CSIT quality a. Fig. 5 plots how the achievable diversity gain with power 
adaptation improves with the channel quality a in a 5 x 3 MIMO channel at the full multiplexing 
gain. We observe that there are fast increases of diversity gain at a = 0.25 and a = 0.1667. These 
two values of a are actually thresholds for dk{r) < 00, k = 1,2,3. When a > = 0.25, 
B = {3}. Therefore, we have do{N) = d^^N). When 0.1667 <a< 0.25, we have B = {2,3} 
and do{N) = d2iN). When a < 0.1667, we have B = {1, 2, 3} and rfo(r) = di{N). Combining 
with the fact that di{r) < d2{r) < d^{r) for any fixed a, it is not difficult to understand the 
cliffs on this curve. 

Note that the additional diversity gain comes at the price of reverse channel pilot power to 
obtain the CSIT. As long as the reverse channel SNR does not scale with p, i.e., a = 0, even 
with some partial CSIT at the transmitter, there will be no improvement on the fundamental 
DMT. However, when the reverse channel SNR relative to p becomes asymptotically zero, i.e., 
a < 1, there will be a significant improvement of the diversity gain. When the reverse channel 
SNR as compared to the forward SNR is asymptotically unbounded, i.e., a > 1, one can achieve 
the full spatial multiplexing gain while enjoying a even more remarkable diversity. 
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VI. Conclusion 

In this paper, we investigated the impact of CSIT on the tradeoff of diversity and spatial 
multiplexing in MIMO fading channels. For MISO/SIMO channels, we showed that using power 
adaptation, one can achieve a diversity gain of d{r) — K{1 — r + Ka), where K is the number 
of transmit antennas in the MISO case or the number of receive antennas in the SIMO case. 
This is not only higher but also more efficient than the available results in literature. For general 
MIMO channels with M > 1 transmit and N > 1 receive antennas, when a is above some 
certain threshold, one can achieve a diversity gain of d{r) = MN(1 + MNa) — (M + — l)r; 
otherwise, the achievable DMT is a collection of discontinuous line segments depending on M, 
N, r and a. The presented DMT shows that exploiting imperfect CSIT through power adaptation 
significantly increases the achievable diversity gain in MIMO channels. 

Appendix A 

Letting = — log(6„)/log(p) for all n and q = [qi, q^], we have 

/• kP . ^ ^ / 1 ^ \ 

■''i^^ {U^^^p-(^^-^+^-^)l-) n=l n<j V l + ^e;^l / 

(48) 

At high SNRs, it is easy to show that 

„ _ / N \ (1-*) 

E{P{il)} = lim / KPC\logpf n dq = P. (49) 

JqtO Vn=l / 

Appendix B 

Let t'A;i,fci denote the solutions of vi, that minimize dk-^{r), and let Vi^k2i '^k2M 

denote the solutions of Vi, that minimize dkr^{r). Without loss of generality, we assume 

vi,ki = ■■■ = t^i-i.fci = T{ki),T{ki) > Vify > a, Vi+i,ki = ■■■ = Vk^^ki = « (50) 

Vim = ■■■ = Vj-i,k2 = T{k2),T{k2) > > a,Vj+iM = ... = Vk2M = ^- (51) 

It follows that the corresponding spatial multiplexing gain r satisfies 

r^{N-i + l)T(fci) - (fci - i)a - Vi,k, (52) 

r^{N-j + l)T{k2) - {k2 - j)a - (53) 
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which leads to 

{{N-J + l)T{k2) - ih - j)a - -{{N-t + l)T{h) - {k, - ^)a - v^^kA = 0. (54) 

We consider the following three cases. 
Case I) j < i: Letting B denote the LHS of dH]), we have 

B >{N - j)r(/e2) - {k2 - j)a - {N - i + l)r{k{) - {ki - i + l)a 

>{N - % + l)r(A:2) - (A;2 - i + l)a - (iV - 2 + l)r(A;i) - (fci - i + \)a (55) 

>(x{k2 - ki) {{N -i + l){M - N + k2 + ki) - 1) > 0. 
This contradicts with 5 = 0. Therefore, j < z is not possible. 
Case 2) j > i: It is easy to observe that 

Vim - = ••• = ^i-i,fe2 - Vi-iM = k2a{M - N + k2) - kia{M - N + ki) > 0(56) 

ViM - > k2a{M -N + k2)- kia{M - N + k{) > Q (57) 

Vi+iM - Vi+iM, ■■■.Vk^M - VkiM > a - a = (58) 

Wfci+i,fc2, ■■■,Vk2,k2 > a. (59) 
Then, it follows that 

fc2 kl 

dk2 (r) - 4i {r) = ^iM - ^i^k^ > 0. (60) 

i=l 1=1 

Case 3) j = i: Similarly, we have 

^^i,fc2 - Vim = ■■■ = ^i-iM - Vi-iM = haiM - iV + /ca) - kia{M - N + ki) > 0(61) 
Vi+i,k2 = ■■■ = Vk2,k2 = Vi+iM = ••• = Vk-iM = ot- (62) 
From ( |54l ). we get 

ViM - ViM = «(^2 - h) {{N -i + l){M -N + k2 + ki) - 1) > 0. (63) 
Combining (|6T1) . (|62l) and (|63l) . we get dk2{r) > dk^{r). The proof of Lemma 2 is complete. 
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Appendix C 

We compare the spatial multiplexing gain r of the corner point k' (k' = 1, .... k — 1) on the 
DMT curve dk{r), i.e., r = {N — k')T{k) — {k — k')a, with the lower boundary of ^k-i, i-e., 
(N -k + l)r{k - 1), and get 

(N - k')T{k) -{k- k')a -{N-k + l)r(fc - 1) > ((A^ - A; + 1)(M - + 2A; - 1) - 1) a > 0. 

(64) 

If {N - k')T{k) -{k- k')a < N, it suffices to have {N - k')T{k) - {k - k')a e Vlk-i. 
Otherwise, we get {N — k')T{k) — {k — k')a ^ Vtk- Since Ofc-ifl^fc — and Vt^ C Vt^, both 
cases lead to r ^ Q,k- This completes the proof of Lemma 3. 
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Fig. 2. DMT in a 3 X 3 MIMO channel. Note tliat do(r) in tlie legend denotes do{r). 
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Fig. 5. Diversity gain at r = N versus channel quality a in a 5 x 3 MIMO channel. 



On the Achievable Diversity-Multiplexing Tradeoff in 
MIMO Fading Channels With Imperfect CSIT 

Xiao Juan Zhang and Yi Gong 



Abstract 

In this paper, we analyze the fundamental tradeoff of diversity and multiplexing in multi-input 
multi-output (MIMO) channels with imperfect channel state information at the transmitter (CSIT). We 
show that with imperfect CSIT, a higher diversity gain as well as a more efficient diversity-multiplexing 
tradeoff (DMT) can be achieved. In the case of multi-input single-output (MISO)/single-input multi- 
output (SIMO) channels with K transmit/receive antennas, one can achieve a diversity gain of d{r) = 
K{\ — r + Ka) at spatial multiplexing gain r, where a is the CSIT quality defined in this paper For 
general MIMO channels with M (M > 1) transmit and N (N > 1) receive antennas, we show that 
depending on the value of a, different DMT can be derived and the value of a has a great impact on 
the achievable diversity, especially at high multiplexing gains. Specifically, when a is above a certain 
threshold, one can achieve a diversity gain of d{r) = AIN{1 + MNa) — (Af + N — l)r; otherwise, 
the achievable DMT is much lower and can be described as a collection of discontinuous line segments 
depending on M, N, r and a. Our analysis reveals that imperfect CSIT significantly improves the 
achievable diversity gain while enjoying high spatial multiplexing gains. 
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Diversity-multiplexing tradeoff, MIMO, channel state information, channel estimation. 
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I. Introduction 

The performance of wireless communications is severely degraded by channel fading caused 
by multipath propagation and interference from other users. Fortunately, multiple antennas can 
be used to increase diversity to combat channel fading. Antenna diversity where sufficiently 
separated or different polarized multiple antennas are put at either the receiver, the transmitter, 
or both, has been widely considered On the other hand, multi-antenna channel fading 

can be beneficial since it can increase the degrees of freedom of the channel and thus can provide 
spatial multiplexing gain. It is shown in |l3l that the spatial multiplexing gain in a multi-input and 
multi-output (MIMO) Rayleigh fading channel with M transmit and N receive antennas increases 
linearly with min(M, A^) if the channel knowledge is known at the receiver. As MIMO channels 
are able to provide much higher spectral efficiency and diversity gain than conventional single- 
antenna channels, many MIMO schemes have been proposed, which can be classified into two 
major categories: spatial multiplexing oriented (e.g.. Layered space-time architecture JH), and 
diversity oriented (e.g., space-time trellis coding Q, |l6l, and space-time block coding |f8l). 

For a MIMO scheme realized by a family of codes {C(p)} with signal-to-noise ratio (SNR) p, 
rate R{p) (bits per channel use), and maximum-likelihood (ML) error probability Ve{p), Zheng 
and Tse defined in [19|| the spatial multiplexing gain r as r = limp^oo £^ and the diversity gain 
(i as (i = — limp^oo Under the assumption of independent and identically distributed (i.i.d.) 
quasi-static flat Rayleigh fading channels where the channel state information (CSI) is known 
at the receiver but not at the transmitter, for any integer r < min(M, A^), the optimal diversity 
gain d*{r) (the supremum of the diversity gain over all coding schemes) is given by [HI 

d*{r) = {M -r){N ~r) (1) 

provided that the code length L > M + N — 1. The diversity-multiplexing tradeoff (DMT) in ([T]) 
provides a theoretical framework to analyze many existing diversity-oriented and multiplexing- 
oriented MIMO schemes. It indicates that the diversity gain cannot be increased without pe- 
nalizing the spatial multiplexing gain and vice versa. This pioneering work has generated a 
lot of research activities in finding DMT for other important channel models [fT0|-[IT3| and 
designing space-time codes that achieve the desired tradeoff of diversity and multiplexing gain 
[fT4l - [fT6l . The DMT analysis was extended to multiple-access channels in [lOJ. The automatic 
retransmission request (ARQ) scheme is shown to be able to significantly increase the diversity 
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gain by allowing retransmissions with the aid of decision feedback and power control in block- 
fading channels [11]. The work in [12] investigated the diversity performance of rate-adaptive 
MIMO channels at finite SNRs and showed that the achievable diversity gains at realistic SNRs 
are significantly lower than those at asymptotically high SNRs. The impact of spatial correlation 
on the DMT at finite SNRs was further studied in [I13II . 

It is natural to expect that the DMT can be further enhanced through power and/or rate 
adaptation if the transmitter has channel knowledge. If the CSI at the transmitter (CSIT) is 
perfectly known, there will be no outage even in slow fading channels since it is always able 
to adjust its power or rate adaptively according to the instantaneous channel conditions. For 
example, it can transmit with a higher power or lower rate when the channel is poor and a lower 
power or higher rate when the channel is good. However, in practice the CSIT is almost always 
imperfect due to imperfect CSI feedback from the receiver or imperfect channel estimation at 
the transmitter through pilots. The work in [[TtII showed that the transmitter training through 
pilots significantly increases the achievable diversity gain in a single-input multi-output (SIMO) 
link. In [18], the authors quantified the CSIT quality as a = — log (Xg / log p, where is the 
variance of the CSIT error, and showed that using rate adaptation, one can achieve an average 
diversity gain of d{a,f) = (1 + a — f)K in SIMO/MISO links, where K = ma.x{M,N) and 
f is the average multiplexing gain. Note that setting a = 1 and ignoring the multiplexing gain 
loss due to training symbols directly yields the result in ifTTl . For general MIMO channels, the 
achievable DMT with partial CSIT is characterized in |[T9l , where the partial CSIT is obtained 
using quantized channel feedback. 

In this paper, we analyze the fundamental DMT in MIMO channels and show that with power 
adaptation, imperfect CSIT provides significant additional diversity gain over ([T]). The imperfect 
CSIT considered in this paper is due to channel estimation error at the transmitter side. In the 
case of MISO/SIMO channels, we show that with power adaptation (under an average sum 
power constraint), one can achieve a higher diversity gain than that with rate adaptation in [18], 
where the authors assumed peak power transmission and thus no temporal power adaptation is 
considered therein. Specifically, we prove that with a CSIT quality a, the achievable diversity 
gain is d{r) = K (1 — r + Ka). It has been shown in our earlier work [20] that this is actually 
the optimal DMT in SIMO/MISO channels with CSIT quality a. For general MIMO channels 
(M > 1, > 1), we show in this paper that depending on the value of a, different DMT 
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can be derived and the value of a has a great impact on the achievable diversity, especially at 
high multiplexing gains. Specifically, when a is above a certain threshold, one can achieve a 
diversity gain of d{r) = MN{1 + MNa) ~ {M + N - l)r; otherwise, the achievable DMT 
is much lower and can be described as a collection of discontinuous line segments depending 
on M, A^, r and a. It is noted that an independent and concurrent work recently reported in 
[|2T| shares some similar results. However, we wish to emphasize that our CSIT model and the 
involved analysis towards the achievable DMT are different from those in [|2T|. The noisy CSIT 
therein is based on the channel mean feedback model in [l22| and an example of obtaining CSIT 
through delayed feedback is provided, whereas the CSIT in our work is estimated from reverse 
channel pilots using ML estimation at the transmitter. As the variance of the channel estimation 
error is inversely proportional to the pilots' SNR ll23l . the CSIT quality a is naturally connected 
to the reverse channel power consumption and any value of a can be achieved by scaling the 
reverse channel transmit power. In addition, our paper provides detailed closed-form solutions to 
the achievable DMT, which offers great insight and depicts directly what the DMT curve with 
imperfect CSIT looks like. 

Notations: IZ^ denotes the the set of real A^-tuples, and IZ^^ denotes the set of non-negative 
A^-tuples. Likewise, C^^^^ denotes the set of complex N x M matrices. For a real number 
X, denotes max(x,0), while for a set C C TZ^, C+ denotes O fl TZ^^. denotes the 
complementary set of O and denotes the empty set. \0\ denotes the cardinality of set O. 
X G (a, b] denotes that the scalar x belongs to the interval a < x < b. Likewise, x E [a, b] is 
similarly defined. CJ\f{0, a"^) denotes the complex Gaussian distribution with mean and variance 
cr^. The superscripts * and 1^ denote the complex conjugate and conjugate transpose, respectively. 
II ■ \\p denotes the matrix Frobenius norm and 1^ denotes the N x N identity matrix. E{-} 
denotes the expectation operator and log(-) denotes the base-2 logarithm, /(p) = p'' denotes that 
b is the exponential order of /(p), i.e., limp_>oo log(/(p))/log(p) = b. Likewise, < is similarly 
defined. Finally, for matrix A, A ^ denotes that A is positive semidefinite; if y is used with 
a vector, it denotes the componentwise inequality. 

The rest of this paper is organized as follows. In section II, we describe the channel model. 
In section III, we propose a power adaptation scheme based on imperfect CSIT and present the 
main result on the achievable DMT. The achievability proof of the presented DMT is given in 
Section IV. Section V provides some discussions. Finally, Section VI concludes this paper. 
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II. Channel Model 

We consider a point-to-point TDD wireless link with M transmit and receive antennas, 
where the downlink and uplink channels are reciprocal. Without loss of generality, we assume 
M > in this paper. As shown in f9|, this assumption does not affect the DMT result. We 
also consider quasi-static Rayleigh fading channels, where the channel gains are constant within 
one transmission block of L symbols, but change independently from one block to another. We 
assume that the channel gains are independently complex circular symmetric Gaussian with zero 
mean and unit variance. The channel model, within one block, can be written as 

Y = ^P/MUX + W (2) 

where H = {hn-m} G C^^*^ with hn,m, rn = 1, 2, . . . , M, n = 1, 2, . . . , A^, being the channel 
gain from the m-th transmit antenna to the n-th receive antenna; X = {X^ i} E C*^^^ with 
Xm,i, m = 1,2,..., M, / = 1, 2, . . . , L, being the symbol transmitted from antenna m at time 
/; Y = {Yn,i} G C^""^ with Yn,i, n = 1, 2, . . . , iV, / = 1, 2, . . . , L, being the received signal at 
antenna n and time /; the additive noise W G C^^^ has i.i.d. entries Wn,i ~ CJ\f{0, cr^); P is the 
instantaneous transmit power while the average energy of Xm,i is normalized to be 1. Letting 
P denote the average sum power constraint, we have E{P} = P. So, the average SNR at the 
receive antenna is given by p = P/cr^. 

We assume that the receiver has perfect CSI H G C^^^, but the transmitter has imperfect 
CSIT H G C^^*^, which is estimated from reverse channel pilots using ML estimation. Thus, 
H can be modeled as ll23]l-[|25l 

H = H + E (3) 

where the channel estimation error E G C^^^ has i.i.d. entries En^m ~ CA/'(0,crg), n = 
1,2, . . . , N, m = 1,2, . . . , M, and is independent of H. The quality of H is thus characterized by 
cTg. If cTg = 0, the transmitter has perfect channel knowledge; if a"^ increases, the transmitter has 
less reliable channel knowledge. We follow ifTSl to quantify the channel quality at the transmitter. 
The transmitter is said to have a CSIT quality a, if = p^". The definition of a builds up 
a connection between the imperfect channel knowledge at transmitters and the forward channel 
SNR, p. Since the variance of the channel estimation error is inversely proportional to the pilots' 
SNR, i.e., cTg oc {SNRpUot)~^ ll23l . any value of a can be achieved by scaling the reverse channel 
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power such that SNRpHot = p°. One can see that the selection of a value actually determines 
the cost of obtaining CSIT in terms of the reverse channel power consumption. When o; = 0, 
the reverse channel SNR does not scale with p, which means that the pilot power is fixed or 
limited; when < « < 1, the reverse channel SNR relative to p is asymptotically zero; when 
a = 1, the reverse channel SNR scales with p at the same rate; when a > 1, the reverse 
channel SNR as compared to the forward channel SNR, p, is asymptotically unbounded [[T8l . In 
the sequel, we will study how the pilot power, or equivalently the CSIT quality a, affects the 
fundamental tradeoff of diversity and multiplexing in the considered channel. Before presenting 
our main results, we give the following probability density function (pdf) expressions and some 
preliminary results that will be used later. 

For m N X M (N < M) random matrix A with i.i.d. entries ~ CAf{0, 1), let < Ai < A2 < 
... < A AT denote the ordered nonzero eigenvalues of AA^ Letting t>„ denote the exponential order 
of 1/Xn for all n, the pdf of the random vector v = [vi, ...,vn] is given by [|26l 



N N / ^ \ 

p(v) = jiim r'llogp)"^ n p-^''~^-^^>- nip""" - p-'O'exp - p-'- 

n=l j>'n. \ n=l / 

(4) 

0, for any u„ < 

niLiP"^'""'^^^"^^"", for all Vn>Q 
where ^ is a normalizing constant. Hence, the probability Vo that (vi, ...,vn) belongs to set O 
can be characterized by 

N 

Vo = p~'^° , for do= inf V (2n - 1 + M - N)vn (5) 

ivi,...,VN)eo+ 

provided that C+ is not empty. 

Letting a = [oi, 02..., a^], < ai < a2 < ■■■ < aj^, b = [61, b2..., bj\f], < bi < b2 < ■■■ < b^, 
and c = [ci, C2..., Cat], < ci < C2 < ... < cat, denote the eigenvalue vectors of HH^ HH^ and 
EE^ respectively, the pdfs of a, b, and c can be shown to be 

N \ 

~ (6) 



N N / ^ \ 

p(a) = n n («n - exp - ^ aj 

n=l n<j \ n=l / 

p(b) = n c-^ n - b,r exp --^ 6„ (?) 

n=l n<j \ ^ ' n=l / 

P(C) = n Cn'-^ n - exp -- ^ C J (8) 

n=l n<j \ e n=l / 
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where = r'(l + a^)-*^^ and = r^^-'^-*'^ 



III. Main Result on DMT 

The ML error probability Ve{p) of the channel described in Q is closely related to the asso- 
ciated packet outage probability Tout, which is defined as the probability that the instantaneous 
channel capacity falls below the target data rate R{p). In fact, the error probability of an ML 
decoder which utilizes a fraction of the codeword such that the mutual information between 
the received and transmitted signals exceeds LR{p) (no outage), averaged over the ensemble of 
random Gaussian codes, can be made arbitrarily small provided that the codeword length L is 
sufficiently large ETl . We will thus leverage on the outage probability to examine the achievable 
diversity gain. If the transmitter has perfect CSIT, it may adopt the optimal power adaptation 
according to the actual instantaneous channel gain such that no outage will occur. With only the 
imperfect CSIT, in order to mitigate the channel uncertainty, we propose the following power 
adaptation scheme. 

Proposition 1: Given H, the transmitter transmits with power 

K P 

^(H) = , ^ u (9) 

where k = illn=i K'^^ - 1 + M - A^)(l - t)] and t (0 < t < 1) can be chosen arbitrarily close 
to 1. 

It is shown in Appendix A that the above power adaptation scheme satisfies the sum power 
constraint £'{P(H)} = P. We believe that given the CSIT quality of a, this power adaptation 
scheme is the optimal power adaptation scheme that maximizes the achievable diversity gain of 
a MIMO fading channel. 

Theorem 1: Consider a MIMO channel with M transmit and receive antennas (M > A^) 
and CSIT quality of a. If the block length L > M + A^ - 1, the achievable DMT using the 
power adaptation scheme in Proposition 1 is characterized by 

Case 1: If A^ = 1 or a > then 

d{r) = MN{1 + MNa) - {M + N -l)r. (10) 
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Case 2: Otherwise, the achievable DMT is a collection of discontinuous line segments, with 
the two end points of line segment dk{r) (k E B) given by 

Left end: 4(r) = k{M -N + k)T{k), for r = {N - k)T{k) 

Right end: 4(r) = {{N - k){k - N - I) + MN)T{k) - {2k - I + M - N){N - I{k))T{I{k)), 

for r = (A^- J(A;))r(J(A;)) 

(11) 

where 

B = [k\{M - N + k){N - k) < l/a,{N - k)T{k) < {N~k)T{k),yk <k,k = 1,...,n], 

T{k) = 1 + ka{M — N + k) and I{k) = max^gg_^<;. k. 

For example, when M = N = 2 and a < 1, the DMT curve consists of two discontinuous line 
segments which are (0, 16a + 4) — (1 + a, 13a + 1) and (1 + a, 1 + a) — (2, 2a). When r = 1 + a, 
the achievable diversity gain is d{r) = 1 + a instead of 13a + 1. From Theorem 1, we can get 

d{0) = MN{1 + MNa) and d{N) = pa{M - N + p){MN + {p - N){N - p + 1))) - + p 
where p = min^g/? k. If a < (jv-i)(M-Ar+i) ' which indicates 1 G i3, we will have d{N) = 
aN{M - N + 1)^. 

IV. Proof of Theorem 1 

The proof involves the computation of the asymptotic ML error probability at high SNRs. We 
will first derive a lower bound of the SNR exponent of the outage probability, denoted as do{r), 
and then show that using a random coding argument the SNR exponent of the error probability 
is no less than do{r) if L > M + N - 1. 

A. Derivation of do{r) 

Optimizing over all input distributions, which can be assumed to be Gaussian with a covariance 
matrix Q without loss of optimality, the outage probability of a MIMO channel with transmit 
power P(H) is given by 

Tout = inf , V flogdet (l^ + ^HQHt) < R{p)\ (12) 

where V{-) denotes the probability of an event. It is shown in [9] that one can get an upper bound 
and a lower bound on the outage probability by picking Q = Ia/ and Q = MIm, respectively. 
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and the two bounds converge in the high SNR regime. Therefore, without loss of generality, we 
consider Q = Im- Substituting ^ in (fT2l) . we have 



V, 



out 



V logdet + 



r (log n f 1 + — 



pKttr, 



HHt^ < R{p)^ 

-) < Rip)) ■ 



(13) 



tN i(2n-l+M-N)t 
ln=l 

Lemma 1: The eigenvalues of Hh\ HH^ and EE^ have the following relationship 

fen <2(a„ + c^), n = l,2,...,Ar. (14) 

Proof: We obviously have the following equality 

(H + E)(H + E)'^ + (H - E)(H - E)^ = 2(HH1' + EE^) (15) 

where both (H + E)(H + E)t and (H — E)(H — E)''" are positive semidefinite matrices. We denote 
the vector of eigenvalues of (HH^ + EE^) as d = [di, cLn] with di < d2 < ... < d^- Since the 
eigenvalues of the sum of two positive semidefinite matrices are at least as large as the eigenvalues 
of any one of the positive semidefinite matrices [[281, we have fe„ < 2dn, n = 1,2,...,N. 
Further, using the relationship of the eigenvalues of the sum of Hermitian matrices, we get 
an + ci < dn < an + cn, n = 1, 2, A^. It thus directly leads to (fT4|) . ■ 
With Lemma 1, the outage probability is upper bounded by 



N 



log n 1 



n=i V Mnti(2a„ + 2c^)(2-i+M-iv) 
Let Vn and u„ denote the exponential orders of l/a„ and l/c^, respectively, i.e., f„ 



(16) 



limp^oo 1^^- Using ©, © and the pdfs of the random vector 



V = [f 1, Vn] and u = [ui, un] can be shown to be 

0, for any f „ < 

nn=iP"('""'+*'^'^^"", for all Vn>0 



p{\) 



p(u) 



0, for any m„ < a 

nil p-(2«-i+M-^){«.-"), for all Un > a. 
At high SNRs, with ([IT]) and ([T8]), becomes 



(17) 



(18) 



Vout < V 



N 



El 



n=l 



N \ 

t{2n -1 + M - N) min(w„, un) 

n=l J 



< r 



(19) 
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So, the outage event O in (fT9l) is the set of {f i, . . . , f at, Mi, . . . , mat} that satisfies 

N / N 



N / N \ 

^ 1 - + 5^ t(2n -1 + M-N) minK,^^) 

n=l \ n=l / 



< r (20) 



where fn > 0, m„ > a > 0, n = 1, 2, A^. 
According to (|5]), we have 

Vout <Vo = p-"^^'') (21) 
where do^r) serves as a lower bound of the SNR exponent of Vout and is given by 

N 

do{r)= inf V(2n - 1 + M - A^) K + u„ - a) . (22) 

{v-i,...,vm,ui,...,um)€0 

Next, we work on the explicit expression of do{r). Since the left hand side (LHS) of (|20l ) 
is a non-decreasing function of ua^, decreasing un will not violate the outage condition in (|20|) 
while enjoying a reduced SNR exponent X]^=i (2?^^ — 1 + M — A^) (f„ + u„ — a). Combining with 
the fact M„ > a, n = 1, 2, iV, the solution of u is found to be = ... = = a. Therefore, 
(|20|) can be rewritten as 







/ AT 




E 


l-Vn + Yl ^(2^ - 1 + M 




n=l 


\ n=l 



To solve the optimization problem of (|22l) . we need to solve the subproblems 



A 



4(r) = inf V(2n-1 + M-Ar)t;„, A; = 0,l,...,Ar (24) 

(''iv,fjv)eCfc , 



^ n=l 

where subset Ok (0 < k < N) is defined as 





N 




N 


Ok = l^Vn 






h M - N)a + ^(2^ - 1 + M 




n=l 


\ n=l 


n=fc+l 



^^1 > ••• > ^^fe > a > ffe+i > ••• > vn} ■ 
So, do{r) is given by 

(ic)(r) = min {do{r), di{r),..., dN^r)) . (25) 

In other words, among all the DMT curves dQ(r),...,dNir), corresponding to the outage subsets 
Oi,...,On, the lowest one will be the DMT curve for the entire outage event. Since t can be 
made arbitrarily close to 1, it is without loss of accuracy to set t = 1 in the rest of this paper. 
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Firstly, we derive do{r). It is easy to show 

N / N \+ N N 

J2 l-t;„ + ^(2n-l + M-Ar)t;„ > A^-^ y„+A^ ^(2n-l + M-Ar)t;„ >N (26) 

n=l \ n=l I n=l n=l 

which suggests that it is possible to operate at spatial multiplexing gain r G [0, A^] reliably 
without any outage, i.e., do(r) = oo. So we can exclude do(r) from the optimization problem 
in (125]). 

Secondly, we derive 4(r) (1 < A; < A^). Note that the function XI^Li (l " + ELi(2^ - 1 
+M — N)a + Z]^=fc+i(2^ — 1 + M — N)vn^^ is an increasing function of Vk+i, ffc+2, v^. 
That is, decreasing Vk+i, Vk+2---,VN does not violate the outage condition for Ok, while reducing 
the SNR exponent — 1+^— ^)^^n- Therefore, the optimal solutions of ffc+i, ■■■,vn 

are vl^^ = vl^2 = ■■■ = v^^ = 0. Consequently, the optimization problem in (|24|) can be 
reformulated as 

k 

dk{r)= inf _ ^(2n - 1 + M - iV)?;„, A; = 1, iV. (27) 

(i>i,...,i)fe)edfc 

Here the modified outage subset Ok is defined as 



Ok = l^^i, Vk NT{k) -J2^Vn<r,a<Vk<...<Vi< r(fc)| . (28) 
where r(A;) = 1 + ka{M — N + k). Careful observation of (|28] ) reveals that 

k 

NT{k) -J2vn> (N - k)T{k). (29) 

n=l 

It implies that there will be no outage (dk{r) = oo), if r < (iV — k)T(k) or (A^ — k)T{k) > N. 
Note that {N-k)T{k) >N^ {M -N + k){N -k) > 1/a. So, if {M-N + k){N-k) < 1/a, 
there will be nonzero outage {dk{r) < oo), for r G where f2/c is defined as 

ilk = i{N-k)T{k), N]. (30) 

For any r G ^7^^ we are able to explicitly calculate the optimal solutions of Vi,...,Vk that 
minimize the SNR exponent I]n=i(2'^ — 1 + M — N)vn- The results are summarized in the 
following. 

1) If r = (A^ — k')T{k) — {k — k')a, k' = 1, 2, k, then the achievable diversity for outage 
event Ok is 

4(r) = k'{M -N + k')T{k) + {k - k'){k + k' + M - N)a. (31) 
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The corresponding optimal solutions of vi, ...v^ are vl = ... = v^, = rik), vl,j^^ = ... = 
vl = a. Specifically, 4(r) = k{M -N + k)T{k) for r = {N - k)T{k). 
2) If {N - k')T{k) -{k- k')a < r < {N - k' + l)r(A;) - {k - k' + l)a, k' = 1, 2, k, the 
achievable diversity for outage event Ok is 

4(r) = ((N-k')ik'-N-l)+MN)T{k) + {k-k' + l){k-k')a-{2k'-l+M-N)r. (32) 

The corresponding optimal solutions of vi,...,Vk are = ... = Vk'-i = T{k), v^i = 
{N -k' + l)T{k) - (/c - k')a - r, v*^,^-^ = ... = vl = a. 

For a particular k' , when spatial multiplexing gain r is between (A^ — k')T(k) — (k — k')a and 
(N — k' + 1)t(A;) — {k — k' + l)a, only one singular value of H, corresponding to the typical 
outage event, needs to be adjusted to be barely large enough to support the data rate. (|32|) further 
shows that dk{r) is a continuous line segment between these two points. It is thus obvious that 
curve dk{r) is piecewise-linear with (r,dk{r)) specified in (|3T1) being its corner points. 

After calculating di{r), dN{r), we remain to solve do{r) = min^ dk{r), k = 1, N. Since 
dk{r) = oo if (M — N + k){N — k) > 1/a, we only need to consider k E A, where set A is 
defined as 

^= {A;|(M-iV + A;)(A^- A;) < l/a,k= 1,...,A^}. (33) 

Note that we always have k = N G A. We consider the following two cases. 
Case 1: A has only one element, i.e., A = {k = N}. In this case, we have (io(r) = (ijv(r). If 
= 1, this condition is naturally satisfied, since there is only one element in A that is k = 1. 
If > 1, we must require (M - + /c)(A^ - A;) > 1/a for /c = 1, A^ - 1, which leads to 

a > TT^, N>1. (34) 
- M- 1 

We now examine the comer points of d^^r). From (|3T1) . we have r = (N — k'){l + aMN — 
a) > 1 + MNa - a for corner point k' (k' = 1,2, A^ - 1). Since 1 + MNa - a is a 
non-decreasing function of a, we easily get r > 1 + — -jj^ > N. Thus we conclude 
that there is only one corner point (0,(iiv(0)) on curve dN{r) over region r G ^In- Therefore, 
doi^) = dN{r) is a straight line between comer points (0,(ijv(0)) and (N,dN{N)). From (|32l) , 
we have dN{N) = MN{1 + MNa) - {M + N - 1)N, so do{r) can be described as 

do{r) = MN{1 + MNa) - {M + N - l)r for < r < A^. (35) 
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Case 2: A has more than one element. Since N e A and — [0, A'^], Qfc (k N,k e A) 
overlaps with il^- That is, there are some regions of spatial multiplexing gain r, leading to finite 
diversity gains on different DMT curves. A straightforward method to find doir) is to numerically 
calculate dk{r) for all k E A, and choose the minimum value among them. However, this makes 
do{r) implicit and hardly insightful. To find the closed-form solution of do{r), we wish to find 
out if there is any relationship among di(r), dN{r). This motivates the birth of the following 
Lemma, the proof of which is given in Appendix B. 

Lemma 2: For any spatial multiplexing gain r e n^fe2 (1 < ^i, ^2 < N), if ki < k2, we 
have 4^(r) < dt^ir). 

This Lemma tells us if a spatial multiplexing gain r leads to finite diversity gains on two 
DMT curves, we only need to select the curve with lower diversity gain. For example, if r e 
^1 n^2 n ■■■ n^AT, then do{r) — o?i(r) since di{r) < d2{r) < ... < dN{r) < 00. Therefore, we 
can further expurgate bad k (s.t. flk Q ^k, fork<kEA) from A and only take into account 
A; e B for the optimization problem, where 

B = [k\{N - k)r{k) <{N- k)r{k), Vk < k,k,k e A} . (36) 

Letting \B\ denote the cardinality of B, we further divide r G [0, iV] into \B\ non-overlapping 
regions with region ^1^ (k e B) defined as 

Qk = ^kO^l, yk KkkkeB 

(37) 

= [{N -k)T{k),{N -X{k))T{X{k))) 

where T{k) indicates the immediately preceding element of k in B, i.e., T{k) — max^^j. j^gg k. 
From Fig. 1, which illustrates the relationship between and Q^, we get do{r) — dk{r) for 
any r e Ctk. 

Next we examine the comer points on curve dk{r) over r e Qfe and give the following Lemma, 
the proof of which is given in Appendix C. 

Lemma 3: For k E B, there is only one corner point, {{N — k)T{k), k{M — N + k)T{k)), 
making r e VL^. 
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As a result, dk{r) over r G is just a single line segment connecting the following two end 
points 

Left end: 4(r) = k{M -N + k)T{k), for r = {N - k)T{k) 
Right end: 4(r) = {{N - k){k - N - 1) + MN)T{k) - {2k - 1 + M ~ N){N - I{k))T{I{k)), 
for r = (A^- J(A;))r(J(A;)). 

(38) 

Finally, since doir) is the union of dk{r) over r G for all k E B, the DMT curve over the 
entire outage event is the collection of all the involved line segments and the two end points of 
line segment dk{r) (k G B) are described in (l38l) . It should be noted that these line segments 
are discontinuous though r is continuous between and N. Combining the above Cases 1 and 
2 directly leads to (flOl) and (fTTj) in Theorem 1. 



B. Achievability Proof 

To complete the proof of the Theorem 1, we need to show that Ve{p) < p^'^oir) if > 
M + N — 1. With the ensemble of i.i.d. complex Gaussian random codes at the input, the ML 
error probability is given by [|9|| 

r^{p) = VoV{enor\0) + P(error, O") <Vo + P(error, O") (39) 

where O and Vq are given by (l20l) and (l22l) . respectively. 

^(error, C^) can be upper-bounded by a union bound. Assume that X(0), X(l) are two possible 
transmitted codewords, and that AX = X(l) — X(0). Suppose X(0) is transmitted, the probability 
that an ML receiver will make a detection error in favor of X(l), conditioned on a certain 
realization of the channel, is 



V (X(0) ^ x(r 



H,H 



V 



'P(H) 



^H(AX) 



< llwl 



(40) 



where w is the additive noise on the direction of H(AX), with variance 1/2. With the standard 
approximation of the Gaussian tail function, Q{x) < 1/2 exp(— x^/2), we have 

P(H) 



P X(0) ^X(l) H,H < exp 



|H(AX)||^ 



(41) 
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Averaging over the ensemble of random codes, we have the average pairwise error probabiUty 
conditioned on the channel realization 



L 



V (X(0) ^ X(1)|h, h) < det {\^ + ^^HHt j . (42) 

With a data rate R = r log(p) (bits per channel use), we have in total p^** codewords. Applying 
the union bound, we have 

P(error|H, H) < p^'' det f I^v + ^^HrA 



2Ma^ 

N / \ -L 

1+M-N 



(43) 



Mnti(2a„ + 2c^)2n-i+M-iv^ 

^(Er=i(l-''"+ELi(2"-l+J^-^)mi>iK.«iv))^- 



n=l 

^iV 



= P 

Averaging over the distributions of H and H, or equivalently v and u, we have 
P(error, C'^) = / p(u)p(v)7^(error|H, H)(iu(iv 

-L. p-dair) 

where 

N 

dair)^ inf y"(2n - 1 + M - N)(vn + - a) 

n=l 

/ N / N V \ 

+ L \ - t^n + X](2n -1 + M-N) min(^;„,'U;v)j -rj . 

When L > M+N-1, dair) has the same monotonicity as En=i (l - + En=i(2ri - 1 + M 
—N)a + E^=fe+i(27T' — 1 + M — N)vn^'^ with respect to Vn or n — 1, N. Therefore, the 
minimum always occurs when E^=i (l — I'n + E^=i(2n — 1 + M — N) min(i;„, un)J~^ — r. 
Hence 

AT 

dG{r)= inf ^ Y.{2n - 1 + M - N){vn + Un - a) 

Er=i(l-''"+Er=i(2'»-l+^-^)mmK,nw))'^=rn=l (46) 

= rfo(r). 
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Therefore, the overall error probability can be written as 

= p-'^o^") +P(error,0'=) (47) 

Since the MIMO channel with the proposed power adaptation scheme leads to an error 
probability lower than or equal to p~"'o('), we can say that the MIMO channel is able to achieve 
the diversity gain of do{r). Theorem 1 is thus obtained. 

V. Discussions 

In this section, we discuss the additional diversity gain Arf(r) brought by the imperfect CSIT 
through power adaptation. 

Case A) iV = 1 (MISO/SIMO): According to GO]), the imperfect CSIT provides an additional 
diversity gain of Arf(r) = M^a at any spatial multiplexing gain in the considered MISO/SIMO 
channel. Most remarkably, when a = 1/M, one can achieve both full diversity gain (i.e., M) and 
full spatial multiplexing gain (i.e., 1) at the same time, while a has to be equal to or greater than 
1 to achieve the same performance in ifTSl . Note that the case of « < 1 is much more practical 
than the case of a > 1 as one usually allocates much less power to the reverse (feedback pilot) 
channel than the forward transmission channel. 

Case B) a > -jjzi, N > 1: For such MIMO channel, according to (flOl) . the additional 
diversity gain is Ad{r) = {W^N^a + r - r^), for r = 0, 1, A^. Specifically, Ad{0) = M'^N'^a 
and Ad{N) = aM'^N'^ + N - > MN'^ + N. If < r < N, the additional diversity gain is 
between the two extreme values Arf(O) and Ad{N). 

Case C) a < A^ > 1: When r = N, AdiN) = d{N) > di{N) = aN{M-N+iy. When 

r < N, for the convenience of comparison with dH, we consider integer spatial multiplexing 
gains, i.e., r = N — k, k = 1,2, N. Since r = N — k<{N — k)T(k), from Theorem 1, the 
achievable diversity gain is d{r) > k{M-N+k)T{k) = {M-r){N-r) (1 + a{M - r){N - r)). 
Recall that the optimal diversity gain without CSIT is d*{r) = (M — r)(N — r). Therefore, the 
additional achievable diversity gain with our scheme is Arf(r) > a{M —r^ {N —rY = a ((i*(r))^. 
It indicates that even a very small a leads to a significant diversity gain improvement. 

We use numerical results to show the additional diversity gain achieved with imperfect CSIT. 
We compare the following two scenarios: 1) No CSIT [9]; and 2) Imperfect CSIT with power 
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adaptation. Figs. 2 and 3 plot the DMT curves for 3x3 and 4x2 MIMO fading channels, 
respectively. It is obvious that imperfect CSIT provides significant additional diversity gain 
improvement and offers non-zero diversity gain at any possible spatial multiplexing gain. Fig. 2 
also shows the impact of a value. When a > = i, we only have d^ir) < oo and thus 
B = {3}. Therefore, the DMT curve is a single line segment. However, when a = | < jj^, 
B = {1, 2, 3}. Therefore, the DMT curve is made up of three discontinuous line segments. Fig. 3 
shows how do{r) depends on di{r) and d2{r) in a 4 x 2 MIMO channel with a = 0.1. We 
observe that d2{r) > di(r) and there is only one comer point on di{r) (or d2{r)) over spatial 
multiplexing gain region r E Cli (or r E (I2). 

Next we illustrate the impact of a on DMT. Fig. 4 plots the relationship between the achievable 
diversity gain and the channel quality a in a MISO/SIMO channel. It clearly shows that power 
adaptation makes better use of the imperfect CSIT than rate adaptation. In other words, to 
achieve the same performance our scheme saves a great amount of pilot power and thus is 
more applicable. Specifically, the diversity gain improvements over [|9l and [fTSll are M^a and 
(M — l)Ma, respectively, at any spatial multiplexing gain. It is no doubt that the achievable 
DMT increases with CSIT quality a. Fig. 5 plots how the achievable diversity gain with power 
adaptation improves with the channel quality a in a 5 x 3 MIMO channel at the full multiplexing 
gain. We observe that there are fast increases of diversity gain at a = 0.25 and a = 0.1667. These 
two values of a are actually thresholds for dk{r) < 00, k = 1,2,3. When a > = 0.25, 
B = {3}. Therefore, we have do{N) = d^^N). When 0.1667 <a< 0.25, we have B = {2,3} 
and do{N) = d2iN). When a < 0.1667, we have B = {1, 2, 3} and rfo(r) = di{N). Combining 
with the fact that di{r) < d2{r) < d^{r) for any fixed a, it is not difficult to understand the 
cliffs on this curve. 

Note that the additional diversity gain comes at the price of reverse channel pilot power to 
obtain the CSIT. As long as the reverse channel SNR does not scale with p, i.e., a = 0, even 
with some partial CSIT at the transmitter, there will be no improvement on the fundamental 
DMT. However, when the reverse channel SNR relative to p becomes asymptotically zero, i.e., 
a < 1, there will be a significant improvement of the diversity gain. When the reverse channel 
SNR as compared to the forward SNR is asymptotically unbounded, i.e., a > 1, one can achieve 
the full spatial multiplexing gain while enjoying a even more remarkable diversity. 
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VI. Conclusion 

In this paper, we investigated the impact of CSIT on the tradeoff of diversity and spatial 
multiplexing in MIMO fading channels. For MISO/SIMO channels, we showed that using power 
adaptation, one can achieve a diversity gain of d{r) — K{1 — r + Ka), where K is the number 
of transmit antennas in the MISO case or the number of receive antennas in the SIMO case. 
This is not only higher but also more efficient than the available results in literature. For general 
MIMO channels with M > 1 transmit and N > 1 receive antennas, when a is above some 
certain threshold, one can achieve a diversity gain of d{r) = MN(1 + MNa) — (M + — l)r; 
otherwise, the achievable DMT is a collection of discontinuous line segments depending on M, 
N, r and a. The presented DMT shows that exploiting imperfect CSIT through power adaptation 
significantly increases the achievable diversity gain in MIMO channels. 

Appendix A 

Letting = — log(6„)/log(p) for all n and q = [qi, q^], we have 

/• kP . ^ ^ / 1 ^ \ 

■''i^^ {U^^^p-(^^-^+^-^)l-) n=l n<j V l + ^e;^l / 

(48) 

At high SNRs, it is easy to show that 

„ _ / N \ (1-*) 

E{P{il)} = lim / KPC\logpf n dq = P. (49) 

JqtO Vn=l / 

Appendix B 

Let t'A;i,fci denote the solutions of vi, that minimize dk-^{r), and let Vi^k2i '^k2M 

denote the solutions of Vi, that minimize dkr^{r). Without loss of generality, we assume 

vi,ki = ■■■ = t^i-i.fci = T{ki),T{ki) > Vify > a, Vi+i,ki = ■■■ = Vk^^ki = « (50) 

Vim = ■■■ = Vj-i,k2 = T{k2),T{k2) > > a,Vj+iM = ... = Vk2M = ^- (51) 

It follows that the corresponding spatial multiplexing gain r satisfies 

r^{N-i + l)T(fci) - (fci - i)a - Vi,k, (52) 

r^{N-j + l)T{k2) - {k2 - j)a - (53) 
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which leads to 

{{N-J + l)T{k2) - ih - j)a - -{{N-t + l)T{h) - {k, - ^)a - v^^kA = 0. (54) 

We consider the following three cases. 
Case I) j < i: Letting B denote the LHS of dH]), we have 

B >{N - j)r(/e2) - {k2 - j)a - {N - i + l)r{k{) - {ki - i + l)a 

>{N - % + l)r(A:2) - (A;2 - i + l)a - (iV - 2 + l)r(A;i) - (fci - i + \)a (55) 

>(x{k2 - ki) {{N -i + l){M - N + k2 + ki) - 1) > 0. 
This contradicts with 5 = 0. Therefore, j < z is not possible. 
Case 2) j > i: It is easy to observe that 

Vim - = ••• = ^i-i,fe2 - Vi-iM = k2a{M - N + k2) - kia{M - N + ki) > 0(56) 

ViM - > k2a{M -N + k2)- kia{M - N + k{) > Q (57) 

Vi+iM - Vi+iM, ■■■.Vk^M - VkiM > a - a = (58) 

Wfci+i,fc2, ■■■,Vk2,k2 > a. (59) 
Then, it follows that 

fc2 kl 

dk2 (r) - 4i {r) = ^iM - ^i^k^ > 0. (60) 

i=l 1=1 

Case 3) j = i: Similarly, we have 

^^i,fc2 - Vim = ■■■ = ^i-iM - Vi-iM = haiM - iV + /ca) - kia{M - N + ki) > 0(61) 
Vi+i,k2 = ■■■ = Vk2,k2 = Vi+iM = ••• = Vk-iM = ot- (62) 
From ( |54l ). we get 

ViM - ViM = «(^2 - h) {{N -i + l){M -N + k2 + ki) - 1) > 0. (63) 
Combining (|6T1) . (|62l) and (|63l) . we get dk2{r) > dk^{r). The proof of Lemma 2 is complete. 
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Appendix C 

We compare the spatial multiplexing gain r of the corner point k' (k' = 1, .... k — 1) on the 
DMT curve dk{r), i.e., r = {N — k')T{k) — {k — k')a, with the lower boundary of ^k-i, i-e., 
(N -k + l)r{k - 1), and get 

(N - k')T{k) -{k- k')a -{N-k + l)r(fc - 1) > ((A^ - A; + 1)(M - + 2A; - 1) - 1) a > 0. 

(64) 

If {N - k')T{k) -{k- k')a < N, it suffices to have {N - k')T{k) - {k - k')a e Vlk-i. 
Otherwise, we get {N — k')T{k) — {k — k')a ^ Vtk- Since Ofc-ifl^fc — and Vt^ C Vt^, both 
cases lead to r ^ Q,k- This completes the proof of Lemma 3. 
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Fig. 2. DMT in a 3 X 3 MIMO channel. Note tliat do(r) in tlie legend denotes do{r). 
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Fig. 3. DMT in a 4 X 2 MIMO channel with a = 0.1. Note that do(r) in the legend denotes do{r) 
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Fig. 4. Diversity gain versus channel quality q in a SIMO/MISO channel. 
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Fig. 5. Diversity gain at r = N versus channel quality a in a 5 x 3 MIMO channel. 



